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(57) Abstract 

A method for isolating mRNAs as cDNAs employs a polymerase amplification method using at least two oligodeoxynu- 
cleotide primers. In one approach, the first primer contains sequence capable of hybridizing to a site immediately upstream of the 
first A ribonucleotide of the mRNA's polyA tail and the second primer contains arbitrary sequence. In another approach, the 
first primer contains sequence capable of hybridizing to a site including the mRNA's polyA signal sequence and the second pri- 
mer contains arbitrary sequence. In another approach, the first primer contains arbitrary sequence and the second primer con- 
tains sequence capable of hybridizing to a site including the Kozak sequence. In another approach, the first primer contains a se- 
quence that is substantially complementary to the sequence of a mRNA having a known sequence and the second primer contains 
arbitrary sequence. In another approach, the first primer contains arbitrary sequence and the second primer contains sequence 
that is substantially identical to the sequence of a mRNA having a known sequence. The first primer is used as a primer for re- 
verse transcription of the mRNA and the resultant cDNA is amplified with a polymerase using both the first and second primers 
as a primer set. 
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"Methods to clone mRNA". 

Background of the Invention 

This application is a contmuation-in-part of the co-pending application U.S. 
Serial No. 07/850,343, filed on March 11, 1992. 
5 This invention relates to methods of detecting and cloning of individual 

mRNAs. 

The activities of genes in cells are reflected in the kinds and quantities of 
their mRNA and protein species. Gene expression is cracial for processes such 
as aging, development, differentiation, metabolite production, progression of the 

10 cell cycle, and infectious or genetic or other disease states. Identification of the 
e:qpressed mRNAs will be valuable for the elucidation of their molecular 
mechanisms, and for applications to the above processes. 

Mammalian cells contain approximately 15,000 different mRNA sequences, 
however, each mRNA sequence is present at a different frequency within the 

15 cell. Generally, mRNAs are expressed at one of three levels. A few "abundant" 
mRNAs are present at about 10,000 copies per cell, about 3,000-4,000 
"intermediate" mRNAs are present at 300-500 copies per cell, and about 11,000 
"low-abundance" or "rare" mRNAs are present at approximately 15 copies per 
cell. The numerous genes that are represented by intermediate and low 

20 fi^juencies of their mRNAs can be cloned by a variety of well established 
techniques (see for example Sambrook et aly 1989, Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor Press, pp. 8.6-8.35). 

If some knowledge of the gene sequence or protein is had, several direct 
cloning methods are available. However, if the identity of the desired gene is 

25 unknown one must be able to select or enrich for the desired gene product in 
order to identify the "unknown" gene without expending large amounts of time 
and resources. 

The identification of unknown genes can often involve the use of 
subtractive or differential hybridization techniques, Subtractive hybridization 
30 techniques rely upon the use of very closely related cell populations, such that 
differences in gene expression wiQ primarily represent the gene(s) of interest. A 
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key element of the subtractive hybridization technique is the construction of a 
comprehensive complementary-DNA ("cDNA") Kbrary. 

The construction of a comprehensive cDNA library is now a fairly routine 
procedure. PolyA mRNA is prepared from the desired cells and the first strand 
5 of the cDNA is synthesized using RNA-dependent DNA polymerase ("reverse 
transcriptase") and an oligodeoxynucleotide primer of 12 to 18 thymidine 
residues. The second strand of the cDNA is synthesized by one of several 
methods, the more efficient of which are commonly known as '^replacement 
synthesis" and "primed synthesis". 

10 Replacement synthesis involves the use of ribonuclease H ("RNAase H"), 

which cleaves the phosphodiester backbone of RNA that is in a RNAiDNA 
hybrid leaving a 3' hydroxyl and a 5' phosphate, to produce nicks and gaps in 
the mRNA strand, creating a series of RNA primers tttat are used by E. coli 
DNA polymerase I, or its "Klenow" fragment, to synthesize the second strand of 

15 the cDNA. This reaction is very efficient; however, the cDNAs produced most 
often lack the 5' terminus of the mRNA sequence. 

Primed synthesis to generate the second cDNA strand is a general name for 
several methods which are more difficult than rq)lacement synthesis yet clone the 
5'^ terminal sequences with high efficiency. In general, after the synthesis of the 

20 first cDNA strand, the 3' end of the cDNA strand is extended with terminal 
transferase, an engone which adds a homopolymeric "tail" of deoxynucleotides^ 
most commonly deoxycytidylate. This tail is then hybridized to a primer of 
oligodeoxyguanidylate or a synthetic ftagment of DNA with an deoxyguanidylate 
tail and the second strand of the cDNA is synthesized using a DNA-dependent 

25 DNA polymerase. 

The primed synthesis method is effective, but the method is laborious, and 
an resultant cDNA clones have a tract of deoxyguanidylate immediately upstream 
of the mRNA sequence. This deoxyguanidylate tract can interfere with 
transcription of the DNA in vitro or in ^dvo and can interfere with the sequendng 

30 of the clones by the Sanger dideoxynucleotide sequencing method. 

Once both cDNA strands have been synthesized, the cDNA library is 
constructed by cloning the cDNAs into an appropriate plasmid or viral vector. 



In practice this can be done by directly Hgating the blunt ends of the cDNAs into 
a vector which has been digested by a restriction endomiclease to produce blunt 
ends. Blunt end ligations are very inefficient, however, and this is not a 
common method of choice. A generaUy used method involves adding synthetic 
linkers or adapters containing restriction endonuclease recognition sequences to ' 
the ends of the cDNAs. The cDNAs can then be cloned into the desired vector 
at a greater efficiency. 

Once a comprehensive cDNA Ubrary is constructed from a cell line, 
desired genes can be identified with the assistance of subtractive hybridization 
(see for example Sargent T.D., 1987, MetK EnzymoL, Vol. 152, pp. 423-432; 
Lee et al, 1991, Proc. Natl. Acad. Sd.. USA, Vol. 88, pp. 2825-2830). A 
general method for subtractive hybridization is as foUows. The complementary 
strand of the cDNA is synthesized and radiolabelled. This single strand of 
cDNA can be made from polyA mKNA or from the existing cDNA library. The 
radiolabelled cDNA is hybridized to a large excess of mliNA from a closely 
related cell population. After hybridization tiie cDNA:mKNA hybrids are 
removed from the solution by chromatography on a hydroxylapatite column. The 
remaining "subtracted" radiolabelled cDNA can then be used to screen a cDNA 
or genomic DNA library of the same cell population. 

Subtractive hybridization removes tiie majority of the genes expressed in 
both cell populations and tiius enriches for genes which are present only in tiie 
desired ceU population. However, if tiie e^qjression of a particular mRNA 
sequence is only a few times more abundant in tiie desired ceU population flian 
tiie subtractive population it may not be possftle to isolate tiie gene by 
subtractive hybridization. 

Summary of the Inyention 
We have discovered a metiiod for identifying, isolating and cloning 
mRNAs as cDNAs using a polymerase amplification metiiod tiiat employs at least 
two oUgodeoxynucleotide primers. In one approach, tiie first primer contains 
sequence enable of hybridizing to a site including sequence tiiat is immediately 
upstream of tiie first A ribonucleotide of tiie mRNA's polyA tail and tiie second 
primer contains arbitrary sequence. In another approach, tiie first primer 



contains sequence capable of hybridizing to a site including the mKNA's pcIyA 
signal sequence and the second primer contains arbitrary sequence. In another 
approach, the first primer contains arbitrary sequence and the second primer 
contains sequence enable of hybridizing to a site including the mRNA's Kozak 
sequence. In another approach, the first primer contains a sequence that is 
substantially complementary to the sequence of a mKNA having a known 
sequence and the second primer contains arbitrary sequence. Li another 
approach, the first primer contams arbitrary sequence and die second primer 
contains sequence that is substantially identical to the sequence of a mRNA 
having a known sequence. The first primer is used as a primer for reverse 
transcription of the mRNA and the resultant cDNA is amplified with a 
polymerase using both the first and second primers as a primer set. 

Using this method with different pairs of the alterable primers, virtually 
any or all of the mRNAs from any cell type or any stage of the ceU cycle, 
including very low abundance mENAs, can be identified and isolated. 
Additionally a comparison of the mENAs firom closely related cells, which may 
be for example at different stages of development or different stages of the cell 
cycle, can show which of the mRNAs are constitutively expressed and which are 
differentially expressed, and their respective frequencies of expression. 

The "first primer" or "first oligodeoxynucleotide" as used herein is defined 
as bemg the oligodeoxynucleotide primer that is used for the reverse transcription 
of the mRNA to make the first cDNA strand, and then is also used for 
amplification of the cDNA. The first primer can also be referred to as the 3' 
primer, as this primer will hybridize to the mRNA and will define the 3' end of 
the first cDNA strand. The "second primer" as used herein is defined as being 
the oligodeoxynucleotide primer that is used to make the second cDNA strand, 
and is also used for the amplification of the cDNA. The second primer may also 
be referred to as the 5' primer, as this primer will hybridize to the first cDNA 
strand and will define the 5' end of the second cDNA strand. 

The "arbitrary" sequence of an oligodeoxynucleotide primer as used herem 
is defined as being based upon or subject to individual judgement or discretion. 
In some instances, the arbitrary sequence can be entirely random or partly 
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random for one or more bases. In other instances the aibitraiy sequence can be 
selected to contain a specific ratio of each deoxynucleotide, for example 
approximately equal proportions of each deoxynucleotide or predominantly one 
deoxynucleotide, or to not contain a specific deoxynucleotide. The aAitraiy 
sequence can be selected to contain, or not to contain, a recognition site for 
specific restriction endonuclease. The arbitrary sequence can be selected to 
either contain a sequence that is substantiaUy identical (at least 50 homologous) 
to a mRNA of known sequence or to not contain sequence from a mRNA of 
known sequence. 

An oligodeoxynuceotide primer can be either "complementaiy" to a 
sequence or "substantially identical" to a sequence. As defined herein, a 
complementarjr oUgodeoxynucleotide primer is a primer that contains a sequence 
which will hybridize to an mRNA, that is the bases are complementary to each 
other and a reverse transci^tase will be able to extend the primer to fonn a 
cDNA strand of the mRNA. As defined herein, a substantiaUy identical primer 
is a primer that contains sequence which is the same as the sequence of an 
mRNA, that is greater than 50% identical, and the primer has the same 
orientation as an mRNA thus it will not hybridize to, or complement, an mRNA 
but such a primer can be used to hybridize to the first cDNA strand and can be 
extended by a polymerase to generate the second cDNA strand. The teims of art 
"hybridization" or "hybridize", as used herein, are defined to be the base pairing 
of an oUgodeoxynucleotide primer with a mRNA or cDNA strand. The 
"conditions under which" an oUgodeoxynucleotide hybridizes with an mRNA or 
a cDNA, as used herein, is defined to be temperature and buffer conditions (that 
are described later) under which the base pairing of the oUgodeoxynucleotide 
primer with either an mRNA or a cDNA occurs and only a few mismatches (one 
or two) of the base pairing are permissible. 

An oUgonucleotide primer can contain a sequence that is known to be a 
"consensus sequence" of an mRNA of known sequence. As defined herein, a 
"consensus sequence" is a sequence that has been found in a gene fenuly of 
proteins having a sinular function or similar properties. The use of a primer that 



inclndes a consensus sequence may result in the cloning of additional membeis of 
a desired gene family. 

The "preferred length" of an oKgodeoxynucleotide primer, as used herein, 
is determined from the desired specificity of annealing and the number of 
oligodeo^gmucleotides having the desired spedfidty that are required to hybridize ' 
to all the mRNAs in a ceU. An oligodeoxynucleotide primer of 20 nucleotides is 
more specific than an oKgodeoxynucleotide primer of 10 nucleotides; however, 
addition of each random nucleotide to an oligodeoxynucleotide primer increases 
by four the number of oligodeoxynucleotide primers required in order to 
hybridize to every toRNA in a cell 

In one aspect, in general, the invention features a method for identifying 
and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription witii a first oKgodeoxynucleotide primer tiiat contains sequence 
capable of hybridizing to a site including sequence that is immediately upstream 
of die first A ribonucleotide of die mRNA's polyA tail, and ampK^g die 
cDNA by a polymerase ampKfication mediod using die first primer and a second 
oKgodeoxynucleotide primer, for example a. primer having arbitrary sequence, as 
a primer set 

In preferred embodiments, die first primer contains at least 1 nucleotide at 
die 3' end of die oKgodeoxynucleotide diat can hybridize to an mENA sequence 
diat is immediately upstream of die polyA tail, and contains at least 11 
nucleotides at the 5' end diat wiU hybridize to die polyA tail. The entire 3' 
oKgodeoxynucleotide is preferably at least 13 nucleotides in lengdi, and can be 
iq) to 20 nucleotides in length. 

Most preferably, die first primer contains 2 nucleotides at the 3' end of die 
oKgodeoxynucleotide diat can hybridize to an mRNA sequence diat is 
immediately upstream of die polyA tail. Preferably, die 2 polyA-non- 
complementary nucleotides are of die sequence VN, where V is deoxyadenylate 
("dA"), deoxyguanylate ("dG"), or deoxycytidylate ("dC"), and N, die 3' 
terminal nucleotide, is dA, dG, dC, or deoxythymidylate ("dT"). Thus die 
sequence of a preferred first primer is S'-iiiimTiTi VN [Seq. ID. No. 1]. 
The use of 2 nucleotides can provide accurate positioning of die first primer at 



the junction between the mRNA and its polyA tail, as the properly aligned 
oligodeoxynucleotide:niRNA hybrids are more stable than improperly aligned 
hybrids, and thus the properly aligned hybrids wiU form and remain hybridized 
at higher temperatures. In preferred applications, the mRNA sample will be 
5 divided into at least twelve aliquots and one of the 12 possible VN sequences of ' 
the first primer wiU be used in each reaction to prime the reverse transcription of 
tiie mRNA. The use of an oligodeoxynucleotide with a single sequence will 
reduce tiie number of mRNAs to be analyzed in each sample by binding to a 
subset of tiie mRNAs, statisticaUy l/12tii, tiius simpliiying tiie identification of 
10 the mRNAs in each sample. 

In some embodiments, tiie 3' end of tiie first primer can have 1 nucleotide 
tiiat can hybridize to an mRNA sequence tiiat is immediately upstieam of tiie 
polyA tail, and 12 nucleotides at the 5' end tiiat will hybridize to tiie polyA tail, 
tiius tiie primer wiU have tiie sequence 5'-TTrrTTTrmTV [Seq. ED. No. 2]. 
15 The use of a single non-polyA-complementaty deoxynucleotide would decrease 
tiie number of oligodeoxynucleotides tiiat are required to identify eveiy mRNA to 
3, however, tiie use of a single nucleotide to position the annealing of primer to 
the junction of the mRNA sequence and flie polyA tail may r^ult in a 
significant loss of specificity of tiie annealing and 2 non-polyA-complementaiy 
nucleotides are preferred. 

In some embodiments, tiie 3' end of tiie first primer can have 3 or more 
nucleotides tiiat can hybridize to an mRNA sequence tiiat is immediately 
upstream of tiie polyA tail. The addition of each nucleotide to tiie 3' end will 
further increase tiie stability of properly aligned hybrids, and tiie sequence to 
hybridize to tiie polyA taH can be decreased by one nucleotide for each additional 
non-polyA-complementaiy nucleotide added. The use of such a first primer may 
not be practical for n^id screening of tiie mRNAs contained within a given cell 
line, as tiie use of a first primer witii more tiian 2 nucleotides tiiat hybridize to 
tiie mRNA immediately upstream of tiie polyA tail significantiy increases tiie 
number of oligodeoxynucleotides required to identify every mRNA. For 
instance, tiie primer 5'-iri riTmTVNN [Seq. ID. No. 3] would require tiie 
use of 48 separate first primers in order to bind to every mRNA, and would 



significantly increase the number of reactions required to screen the mRNA ftom 
a given cell line. The use of oligodeoxynucleotides with a single random 
nucleotide in one position as a group of four can circumvent the problem of 
needing to set up 48 separate reactions in order to identify every mRNA. 
However as the non-polyA-complementaiy sequence became longer, it would 
quickly become necessary to increase the number of reactions required to identify 
every mRNA. 

In preferred embodiments, the second primer is of arbitrary sequence and 
is at least 9 nucleotides in length. Preferably the second primer is at most 13 
nucleotides in length and can be up to 20 nucleotides in lengtii. 

Li another aspect, in general, the invention features a method for preparing 
and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first primer that contains a sequence capable of hybridizing 
to the polyadenylation signal sequence and at least 4 nucleotides that are 
positioned 5', or 3', or both of the polyadenylation signal sequence; this entire 
first primer is preferably at least 10 nucleotides in length, and can be iq) to 20 
nucleotides in length. In one preferred embodiment the sequence 5'- 
NNTTTATrNN [Seq. ID. No. 4] can be chosen such that the sequence is 5'- 
GCTTTATTNC [Seq. ID. No. 5], and the four resultant primers are used 
together in a single reaction for the priming of the mRNA for reverse 
transcription. Once the first cDNA strand has been formed by reverse 
transcription then the first primer can be used with a second primer, for example 
and arbitrary sequence primer, for the amplification of the cDNA. 

In one aspect, in general, the invention features a method for identifying 
and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first oligodeoxynucleotide primer to generate a first cDNA 
strand, and priming the preparation of the second cDNA strand with a second 
primer that contains sequence substantially identical to the Kozak sequence of 
mRNA, and amplifying the cDNA by a polymerase amplification method using 
the first and second primers as a primer set 

In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 



up to 20 nucleotides in length. Most preferably the first and second piimers are 
10 deoxynucleotides in length. 

In preferred embodiments the sequence of the first primer is selected at 
random, or the first primer contains a selected aibitraiy sequence, or the first 
primer contains a restriction endonuclease recognition sequence. 

In preferred embodiments the sequence of the second primer that contains 
sequence substantially identical to the Kozak sequence of mRNA has the 
sequence NNNANNATGN [Seq. ID No. 6], or has the sequence 
NNNANNATGG [Seq. ID No. 7]. Where N is any of the four 
deoxynucleotides. Prefaably, the second primer has the sequence 
GCCACCATGG [Seq. ID No. 8]. In some embodiments the first primer may 
further include a restriction endonuclease recognition sequence that is added to 
either the 5' or 3' end of the primer increasing the length of the primer by at 
least 5 nucleotides. 

In another aspect, in general, the invention features a method for 
identifying and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first oligodeoxynucleotide primer that contains sequence that 
is substantially complementary to the sequence of a mRNA ha^dng a known 
sequence, and priming the preparation of the second cDNA strand with a second 
primer and, amplifying the cDNA by a polymerase amplification method using 
the first and second primers as a primer set. 

In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 
up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 

In preferred embodiments the sequence of the first primer further includes 
a restriction endonuclease sequence, which may be included within the preferred 
10 nucleotides of the primer or may be added to either the 3' or 5' end of the 
primer increasing the length of the oligodeoxynucleotide primer by at least 5 
nucleotides. 
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Li preferred embodiments the sequence of the second primer is selected at 
random, or the second primer contains a selected arbitrary sequence, or the 
second primer contains a restriction endonuclease recognition sequence. 
In another aspect, in general, the invention features a method for 
5 identifying and isolating mRNAs by priming a preparation of mRNA for reverse ' 
transcription with a first oUgodeoxynucleotide primer, and priming the 
preparation of the second cDNA strand with a second primer that contains 
sequence that is substantially identical to the sequence of a mRNA having a 
known sequence and, amplifying the cDNA by a polymerase amplification 

10 method using the first and second primers as a primer set. 

In preferred embodiments, the first and second primers are at least 9 
deoxynucleotidejs in length, and are at most 13 nucleotides in length, and can be 
up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 

15 In preferred embodiments the sequence of the first primer is selected at 

random, or the first primer contains a selected arbitrary sequence, or the first 
primer contains a restriction endonuclease recognition sequence. 

in preferred embodiments the sequence of the second primer having a 
sequence that is substantially complementary to the sequence of an mRNA having 

20 a known sequence further includes a restriction endonuclease sequence, which 
may be included within the preferred 10 nucleotides of the primer or may be 
added to either the 3' or 5' end of the primer increasing the length of the 
oligodeojqmucleotide primer by at least 5 nucleotides. 

In another aspect, in general, the invention features a method for 

25 identifying and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first oUgodeoxynucleotide primer that contains sequence that 
is substantially complementary to the sequence of a mRNA having a known 
sequence, and priming the prq)aration of the second cDNA strand with a second 
primer that contains sequence that is substantially identical to the Kozak sequence 

30 of mRNA, and amplifying the cDNA by a polymerase amplification method 
using the first and second primers as a primer set. 
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In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 
up to 20 nucleotides in length- Most preferably the first and second primers are 
10 deoxynucleotides in length. 

In some preferred embodiments of each of the general aspects of the 
invention, the amplified cDNAs are separated and then the desired cDNAs are 
reamplified using a polymerase amplification reaction and the first and second 
oligodeoxynucleotide primers. 

In preferred embodiments of each of the general aspects of the invention, a 
set of first and second oligodeoxynucleotide primers can be used, consisting of 
more than one of each primer. In some embodiments more than one of the first 
primer will be^included in the reverse transcription reaction and more than one 
each of the first and second primers will be included in the amplification 
reactions. The use of more than one of each primer will increase the number of 
mRNAs identified in each reaction, and the total number of primers to be used 
will be determined based upon the desired method of separating the cDNAs such 
that it remains possible to fully isolate each individual cDNA. In preferred 
embodiments a few hundred cDNAs can be isolated and identified using 
denaturing polyacrylamide gel electrophoresis. 

The method according to the invention is a significant advance over current 
cloning techniques that utilize subtractive hybridization. In one aspect, the 
method according to the invention enables the genes which are altered in their 
frequency of expression, as well as of mRNAs which are constitutively and 
differentially expressed, to be identified by simple visual inspection and isolated. 
In another aspect the method according to the invention provides specific 
oligodeoxynucleotide primers for amplification of the desired mRNA as cDNA 
and makes unnecessary an intermediary step of adding a homopolymeric tail to 
the first cDNA strand for priming of the second cDNA strand and thereby 
avoiding any interference from the homopolymeric tan with subsequent analysis 
of the isolated gene and its product. In another aspect the method according to 
the invention allows the cloning and sequencing of selected mRNAs, so that the 
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investigator may determine the relative desirability of the gene prior to screening 
a comprehensive cDNA library for the full length gene product. 

Description of the Preferred Embodiments 

Drawings 

5 Fig, 1 is a schematic representation of the method according to the 

invention. 

Fig. 2 is the sequence of the 3' end of the Nl gene ficom normal mouse 
fibroblast cells (A31) [Seq. ID. No. 9]. 

Fig. 3 is the Northern blot of the Nl sequence on total cellular RNA from 
10 normal and tumorigenic mouse fibroblast cells. 

Fig. 4 is a sequencing gel showing the results of amplification for mRNA 
prepared fiom four sources (lanes 1-4), using the Kozak primer alone, ±e AP-1 
primer alone, the Kozak and AP-1 primers, the Kozak and AP-2 primers, the 
Kozak and AP-3 primers, the Kozk and AP-4 primers and the Kozak and AP-5 
15 primers. This gel will be more fully described later. 

Fig. 5 is a partial sequence of the 5' end of a clone, Ki, that was cloned 
firom the Al-5 cell line that was cultured at the non-permissive temperature and 
then shifted to the permissive temperature (32.5 ''C) for 24 h prior to the 
preparation of the mRNA. The Al-5 cell line is from a primary rat embryo 
20 fibroblast cell line that has been doubly transformed with ras and a temperature 
sensitive mutation of P^^ ("P^^"""). 
General Description. Development of the Method 

By way of illustration a description of examples of the method of the 
invention follows, with a description by way of guidance of how the particular 
25 illustcative examples were developed. 

It is important for operation of the method that the length of the 
oligodeoxynucleotide be appropriate for specific hybridization to mRNA. In 
order to obtain specific hybridization, whether for conventional cloning methods 
or PGR, oligodeoxynucleotides are usually chosen to be 20 or more nucleotides 
30 in length. The use of long oligodeoxynucleotides in this instance would decrease 
the number of mRNAs identified during each trial and would greatiy increase the 
number of oligodeoxynucleotides required to identify every mRNA. Recentiy, it 
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was demonstrated that 9-10 nucleotide primers can be used for DNA 
polymorphism analysis by PGR (Williams et al, 1991, Nuc. Adds Res., Vol. 
18, pp. 6531-6535). 

The plasmid containing the cloned murine thymidine kinase gene ("TX 
cDNA plasmid") was used as a model template to detennine the required lengths ' 
of oligodeoxynucleotides for specific hybridization to a mRNA, and for the 
production of specific PGR products. The oligodeoxynucleotide primer chosen to 
hybridize internally in the mRNA was varied between 6 and 13 nucleotides in 
length, and the oUgodeoxynucleotide primer chosen to hybridize at the upstream 
end of the polyA tail was varied between 7 and 14 nucleotides in length. After 
numerous trials with different sets and lengths of primers, it was determined that 
the annealing temperature of 42»C is optimal for product specificity and the 
internally hybridizing oligodeoxynucleotide should be at least 9 nucleotides in 
length and a oUgodeoxynucleotide that is at least 13 nucleotides in length is 
required to bind to the upstream end of the polyA tail. 

With reference now to Fig. 1, the mediod according to the invention is 
depicted schematicaUy. The mRNAs are mixed with the first primer, for 
example 1 1 in iTmTVN [Seq. ID. No. 2] CTnVN) 1, and revene 
transcribed 2 to make the first cDNA strand. Hie cDNA is amplified as foUows. 
The first cDNA strand is added to the second primer and the first primer and the 
polymerase in the standard buffer with the appropriate concentrations of 
nucleotides and the components are heated to 94"'C to denature the mRNArcDNA 
hybrid 3, the temperature is reduced to 42"'C to allow the second primer to 
anneal 4, and then the temperature is increased to 72»C to allow the polymerase 
to extend the second primer 5. TTie cycling of the temperature is then repeated 
6, 7, 8, to begin the amplification of the sequences which are hybridized by the 
first and second primers. The temperature is cycled until the desired number of 
copies of each sequence have been made. 

As is weU known in the art, this amplification method can be accomplished 
using thermal stable polymerase or a polymerase that is not thermal stable. 
When a polymerase that is not thermal stable is used, fiesh polymerase must be 
added after the annealing of the primers to the templates at the start of the 
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elongation or extending step, and the extension step must be carried out at a 
temperature that is permissible for the chosen polymerase. 

The foUowing examples of the method of the invention are presented for 
iUustrative puiposes only. As will be qjpreciated. the method according to the 
invention can be used for the isolation of polyA mRNA from any source and can 
be used to isolate genes expressed either differentially or constitutively at any 
level, from rare to abundant. 

Examplft 1 

Ejcperimentation with the conditions required for accurate and reproducible 
results by PGR were conducted with the TK cDNA plasmid and a single set of 
oligodeoxynucleotide primers; the sequence lllllliTiTiC A ("TuCA") [Seq 
ID. No. 10] wa§ chosen to hybridize to the upstream end of the polyA tail and 
the sequence CTTGATTGCC CTtkS") [Seq. ID. No. 11] was chosen to 
hybridize 288 base pairs ("bp") upstream of the polyA tail. The expected 
fragment size using these two primers is 299 bp. 

PGR was conducted under standard buffer conditions weU known in the art 
with 10 ng TK cDNA plasmid (buffer and polymerase are available ftom Perkin 
Hmer-Cetus). The standard conditions were altered in that the primers were 
used at concentrations of 2.5 iM T^CA [Seq. ID. No. 10] , 0.5 ^tM Ltk3 [Seq. 
ID. No. 11] , instead of 1 of each primer. The concentration of the 
nucleotides ("dNTPs") was also varied over a 100 fold range, from the standard 
200 AiM to 2 /iM. The PGR parameters were 40 cycles of a denaturing step for 
30 seconds at 94X, an annealing step for 1 minute at 42'C, and an extension 
step for 30 seconds at 72''C. Significant amounts of non-specific PGR products 
were observed when the dNTP concentration was 200 /*M, concentrations of 
dNTPs at or below 20 ^tM yielded specifically amplified PGR products. The 
specificity of the PGR products was verified by restriction endonuclease digest of 
the amplified DNA, which yielded the expected sizes of restriction fiagments. In 
some instances it was found fliat the use of up to 5 fold more of the first primer 
than the second primer also functioned to increase the specificity of the product. 
Lowering the dNTP concentration to 2 /zM aUowed the labelling of the PGR 
products to a high spedfic activity with [a-^^S] dATP, 0.5 /xM [a-^^S] dATP (Sp. 
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Act. 1200 Ci/mraol), which is necessary for distinguishing the PGR products 
when resolved by high resolution denaturing polyacrylamide gel electrophoresis, 
in this case a DNA sequencing gel. 
Example 2 

The PGR method of amplification with short oligodeoxynucleotide primers ' 
was then used to detect a subset of mRNAs in mammalian ceUs. Total RNAs 
and mRNAs were prepared from mouse fibroblasts cells which were either 
growing normally, "cycling", or serum starved, "quiescent". The RNAs and 
mRNAs were reverse transcribed with TnCA [Seq. ID. No. 10] as the primer. 
The TnCA primer [Seq. ID. No. 10] was annealed to the mRNA by heating the 
mRNA and primer together to 65 "C and allowing the mixture to gradually cool 
to 35 "C. The reverse transcription reaction was carried out with Moloney 
murine leukemia vims reverse transcriptase at 35°C. The resultant cDNAs were 
amplified by PGR in the presence of T^GA [Seq. ID. No. 10] and Ltk3 [Seq. 
ID. No. 11] , as described in Example 1, using 2 /tM dNTPs. The use of the 
T„GA [Seq. ID. No. 10] and Ltk3 [Seq. ID. No. 11] primers allowed the TK 
mRNA to be used as an internal control for differential expression of a rare 
mRNA transcript; TK mRNA is present at jqjproximalely 30 copies per cell. 
The DNA sequencing gel revealed 50 to 100 amplified mRNAs in the size range 
which is optimal for further analysis, between 100 to 50Q nucleotides. The 
patterns of the mRNA species observed in cycling and quiescent cells were very 
similar as expected, though some differences were apparent. Notably, the TK 
gene mRNA, which is expressed during Gl and S phase, was found only in the 
RNA preparations from cycling cells, as expected, thus demonstrating the ability 
of this method to separate and isolate rare mRNA species such as TK. 
Example 3 

The expression of mRNAs in normal and tumorigenic mouse fibroblast 
cells was also compared using the T„GA [Seq. ID. No. 10] and Ltk3 [Seq. ID. 
No; 11] primers for the PGR amplification. The mRNA was reverse transcribed 
using TiiCA [Seq. ID. No. 10] as the primer and the resultant cDNA was 
amplified by PGR using 2 fjM dNTPs and the PGR parameters described above. 
The PGR products were separated on a DNA sequencing gel. The TK mRNA 
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was present at the same level in both the nonnal and tumorigenic mRNA 
preparations, as expected, and provided a good internal control to demonstrate 
the rq)resentation of rare mRNA species. Several other bands were present in 
one preparation and not in the other, with a few bands present in only the mRNA 
ftom normal cells and a few bands present only in the mRNA ftom the 
tumorigenic cells; and some bands were expressed to different levels in the 
normal and tumorigenic cells. Thus, the metiiod according to the invention can 
be used to identify genes which are normally continuously expressed 
(constitutive), and differentially expressed, suppressed, or otherwise altered in 
their level of expression. 
Cloning of the mRNA identified in Example '^ 

Three cD^As that are, the TK cDNA, one cDNA expressed only in 
normal cells ("Nl"), and one cDNA expressed only in tumorigenic cells ("Tl"), 
were recovered from the DNA sequencing gel by electroelution, ethanol 
predpitaled to remove the urea and other contaminants, and reamplified by PGR, 
in two consecutive PGR amplifications of 40 cycles each, with the primers T,iGA 
[Seq. m. No. 10] and Ld£3 [Seq. ID. No. 11] in the presence of 20 /tM dNTPs 
to achieve optimal yield witiiout compromising the specificity. The reamplified 
PGR products were confirmed to have the appropriate sizes and primer 
dependencies as an additional control the reamplified TK cDNA was digested 
with two separate restriction endonucleases and the digestion products were also 
confirmed to be of the correct size. 

The reamplified Nl [Seq. ID. No. 9] was cloned with the TA cloning 
system, Livitcogen Inc., into the plasmid pGRlOGO and sequenced. With 
reference now to Fig. 2, the nucleotide sequence clearly shows the Nl ftagment 
[Seq. m. No. 9] to be flanked by the underlined Ltk3 primer 15 at the 5' end 
and the underlined T^CA primer 16 at the 3' end as expected. 

A Northern analysis of total ceUular RNA using a radiolabelled Nl probe 
reconfirmed that the Nl mRNA was only present in the normal mouse fibroblast 
cells, and not in the tumorigenic mouse fibroblast cells. With reference now to 
Fig. 3, the probe used to detect the mRNA is labelled to the right of the figure, 
and the size of the Nl mRNA can be estimated from the 28S and 18S markers 
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depicted to the left of the figure. The Nl mRNA is present at low abundance in 
both exponentially growing and quiescent normal cells, lanes 1 and 3, and is 
absent from both exponentially growing or quiescent tumorigenic cells, lanes 2 
and 4. As a control, the same Northern blot was reprobed with a radiolabelled 
5 probe for 36B4, a gene that is expressed in both normal and tumorigenic cells, to 
demonstrate that equal amounts of mRNA, lanes 1-4, were present on the 
Northem blot. 
Example 4 

The comparison of the expression of mRNAs in three cell lines, one of 

10 which was tested after culturing under two different conditions, was conducted. 
The cell lines were a primary rat embryo fibroblast cell line ("REF"), the KEF 
cell line that has been doubly transformed with las and a mutant of P^^ ("TlOl- 
4"), and the REF cell line that has been doubly transformed with ras and a 
temperature sensitive mutation of P^^ ("Al-5"). The Al-5 cell line was cultured 

15 at the non-permissive temperature of 37oC, and also cultured at 37oC then 

shifted to the permissive temperature of 32.5oC for 24 h prior to the preparation 
of the mRNA. The method of the invention was conducted using the primers 
"Kozak" and one of five arbitrary sequence primers, *AP-1, AP-2, AP-3. AP-4, 
or AP-5", as the second and first primers, respectively. 

20 The sequence of the "Kozak" primer was chosen based upon the published 

consensus sequence for the translation start site consensus sequence of mRNAs 
(Kozak, 1991, Jour. Cell Biology, Vol. 115, pp. 887-903). A degenerate Kozak 
primer having sequences substantially identical to the translation start site 
consensus sequence were used simultaneously, these sequences were 5'- 

25 GCCRCCATGG [Seq. ID No. 12], in which ±e R is dA or dG and thus the 
oligodeoxynucleotide primer has only one of the given nucleotides which results 
in a mixture of primers. 

The sequence of the five arbitrary primers was a follows: AP-1 had the 
sequence 5'-AGCCAGCGAA [Seq. ID. No. 13]; AP-2 had the sequence 

30 5*"GACCGCTTGT [Seq. ID. No. 14]; AP-3 had the sequence 5'- 

AGGTGACCGT [Seq. ID. No. 15]; AP-4 had the sequence 5'-GGTACTCCAC 
[Seq. ID. No. 16]; and AP-5 had the sequence 5'-GTTGCGATCC [Seq. ID. No. 
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17]. These aibitrary sequence primers were chosen arbitrarily. Jn general each 
aibittary sequence primer was chosen to have a GC content of 50-70%. 

The mENA was reverse transcribed using one of the AP primers, as the 
first primer, and the resultant first cDNA strand was amplified in the presence of 
both primers, the AP primer and the degenerate Kozak primer, by PGR using 2 
fiM. NTPs and the PGR parameters described above. The PGR products were 
separated on a DNA sequencing gel. At least 50-100 amplified cDNA bands 
were present in each of the ceE lines tested, and some bands were expressed to 
different levels in the different cell lines. As a control a reaction was conducted 
using each arbitrary primer in the absence of the Kozak primer. No cDNA was 
generated by the arbitrary primer alone, thus demonstrating that both primers 
were required to anplify an mRNA into a cDNA 

With reference now to Fig. 4, the primer sets used for each reaction are 
shown at the top of the Fig. along the line marked Primers . As a control a 
reaction was conducted using die primers in the absence of mRNA, and using 
AP-1 with mRNA in the absence of the Kozak primer. No cDNA was generated 
by the primers in the absence of mRNA or by the arbitrary primer alone, thus 
demonstrating that mRNA is required for amplification and that both primers 
were required to amplify an mRNA into a cDNA. The cDNA products of the 
amplification were loaded m the same order across the gel, flius the REF cell line 
is shown in each of lanes 1, cell line TlOl-4 is shown in each of lanes 2, cell 
line Al-5 cultured at 37oG is shown in each of lanes 3, and cell line Al-5 
cultured at 32.5oC is shown in each of lanes 4. Each pair of primers resulted in 
the amplification of a different set of mRNAs from the cell lines. The reactions 
which were conducted using the Kozak primer and any of primers AP-1, AP-2, 
AP-4, or AP-5 as a primer set resulted in the amplification of the same dDNA 
pattern from each of cell lines REF, TlOl-4, Al-5 cultured at 37oG and Al-5 
cultured at 32.5oG. The amplification of mRNA firom each cell line and 
temperature using the Kozak degenerate primer and the AP-3 primer resulted in 
the finding of one band in particular which was present in the mRNA prepared 
from the Al-5 cell line when cultured at 32.5oG for 24 h, and not in any of the 
other mRNA preparations, as can be seen in Fig. 4 designated as Ki. Thus the 
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method according to the inventioii may be used to identify genes which are 
differentially ejcpressed in mutant cell lines. 
Cloning of the n^PMA identified in Example 4 

The cDNA ("Ki") that was expressed only in the Al-5 cell line when 
cultured at 32.50C was recovered from the DNA sequencing gel and reampMed 
using the primers Kozak and AP-3 as described above. The reamplified Ki 
cDNA was confirmed to have the appropriate size of approximately 450 bp, and 
was cloned with the TA cloning system, Invitrogen Inc., into the vector pCRH 
(Invitrogen, Inc.) according to the manufacturers instractions, and sequenced. 
With reference now to Fig. 5, the nucleotide sequence cleariy shows the Ki 
clone to be flanked by the underlined Kozak primer 20 at the 5' end and the 
underlined AP-3 primer 21 at the 3' end as eKpected. The 5 ' end of this partial 
CDNA is identified in Seq. ID No. 18, and the 3' end of this cDNA is identified 
in Seq. ID No. 19. This partial sequence is an open reading frame, and a search 
of the gene databases EMBO and Genbank has revealed the translated amino acid 
sequence from the 3' portion of Kl to be homologous to the ubiquitin conjugating 
enzyme family (UBC enzyme). The translated amino acid sequence of the 3' 
portion of Kl is 100% identical to a UBC enzyme from D. melanogaster and 
75% identical to the UBC-4 enzyme and 79% identical to the UBC-5 enzyme 
from the yeast S. saccharomyces; and 75% identical to the UBC enzyme from 
Arabidopsis thaUana. The Kl clone may contain the actual 5' end of this gene, 
otherwise the Kozak primer hybridized just after the 5' end. This result 
demonstrates that the method according to die invention can be used to clone the 
5' coding sequence of a gene 
Use 

The method according to the invention can be used to identify, isolate and 
clone mRNAs from any number of sources. The method provides for the 
identification of desirable mRNAs by simple visual inspection after separation, 
and can be used for investigative research, industrial and medical appKcations. 

For instance, the reamplified cDNAs can be sequenced, or used to screen a 
DNA library in order to obtain the fiiU length gene. Once the sequence of the 
cDNA is known, amino acid peptides can be made from the translated protein 
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sequence and used to raise antibodies. These antibodies can be used for further 
research of tiie gene product and its function, or can be applied to medical 
diagnosis and prognosis. The reamplified cDNAs can be cloned into an 
appropriate vector for further propagation, or cloned into an appropriate 
5 expression vector in order to be expressed, either in vitro or in vivo. The 

cDNAs which have been cloned into e3q)ression vectors can be used in industrial 
situations for overproduction of the protein product. In other applications the 
reamplified cDNAs or their respective clones will be used as probes for in situ 
hybridization. Such probes can also be used for the diagnosis or prognosis of 
10 disease. 

Other Embodiments 
Other embodiments are within the following claims. 
The length of the oligodeoxynucleotide can be varied dependent upon tiie 
annealing temperature chosen. In the preferred embodiments the temperature 
15 was chosen to be 42^C and the oligonucleotide primers were chosen to be at 
least 9 nucleotides in length. If the annealing temperature were decreased to 
35°C then the oligonucleotide lengths can be decreased to at least 6 nucleotides 
in length. 

The cDNA could be radiolabelled with radioactive nucleotides other than 
20 such as 33p^ When desired, non-radioactive imaging methods can 

also be applied to the method according to the invention. 

The amplification of ±e cDNA could be accomplished by a temperature 
cycling polymerase chain reaction, as was desaibed, using a heat stable DNA 
polymerase for the repetitive copying of the cDNA while cycling the temperature 
25 for continuous rounds of denaturation, amealing and extension. Or the 

amplication could be accomplished by an isothermal DNA amplification method 
(Walker erflZ., 1992, Proc, Natl Acad. 5d, , Vol. 89, pp. 392-396). The 
isothermal amplification method would be adapted to use for amplifying cDNA 
by including an appropriate restriction endonuclease sequence^ one that will be 
30 nicked at hemiphosphorothioate recognition sites and whose recognition site can 
be regenerated during synthesis with c^^S labelled dNTPs. 
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Proteins having similar function or similar functional domains are often 
referred to as being part of a gene family. Many such proteins have been cloned 
and identified to contain consensus sequences which are highly conserved 
amongst the members of the family. This conservation of sequence can be used 
5 to design oligodeoxynucleotide primers for the cloning of new members, or 
related members, of a family. Using the method of the invention the mRNA 
from a cell can be reverse transcribed, and a cDNA could be amplified using at 
least one primer that has a sequence substantially identical to the sequence of a 
mRNA of known sequence. Consensus sequences for at least the following 

10 families and functional domains have been described in the literature: protein 
tyrosine kinases (Hanks et aZ., 1991, Methods on Enzymology, Vol. 200, pp. 38- 
81; Wilks, 1991, Methods in Enzymology, Vol. 200, pp. 533-546); homeobox 
genes; zinc-finger DNA bindmg proteins (Miller et al , 1985, EMBO Jour. , Vol. 
4, pp. 1609-1614); receptor proteins; the signal peptide sequence of secreted 

15 proteins; proteins that localize to the nucleus (Guiochon-Mantel et al , 1989, 
Vol. 57, pp. 1147-1154); serine proteases; inhibitors of serine proteases; 
cytokines; the SH2 and SH3 domains that have been described in tyrosine kinases 
and otiier proteins (Pawson et al, 1992, Cell, Vol 71, pp. 359-362); 
serine/threonine and tyrosine phosphatases (Cohen, 1991, Methods in 

20 Enzymology, Vol. 201, pp. 398-408); cyclins and cyclin-dppendent protein 
kinases (CDKs) (see for ex., Keyomarsi et aZ., 1993, Proc. Natl. Acad. Sci., 
USA, Vol. 90, pp. 1112-1116). 

Primers for any consensus sequence can readily be designed based upon the 
codon usage of the amino acids. The incorporation of degeneracy at one or more 

25 sites allows the designing of a primer which will hybridize to a high percentage, 
greater than 50%, of the mRNAs containing the desired consensus sequence. 

Primers for use in the method according to the invention could be designed 
based upon the consensus sequence of the zinc finger DNA binding proteins, for 
example, based upon the amino acid consensus sequence of the proteins PYVC, 

30 Useful primers for the cloning of further members of this family can have the 
following sequences: 5'-GTAYGCNTGT [Seq. ID. No. 20] or 5'- 
GTAYGCNTGC [Seq. ED. No. 21], in which the Y refers to tiie 
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deoxynucleotides dT or dC for which the primer is degenerate at this position, 
and the N refers to inosine ("I"). The base inosine can pair with all of the 
other bases, and was chosen for this position of the oligodeoxynucleotide as the 
codon for valine "V" is highly degenerate in this position. The described 
oligodeoxynucleotide primers as used will be a mixture of 5*-GTATGCITGT 
and 5'-GTACGCITGT or a mixture of S'-GTATGCTTGC and 
5'-GTACGCrrGC. 
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SEQUENCE LISTING 

(1) GENERAL INFORBIATION: 

(i) APPLICANT: Liang, Peng 

Paxdee, Arthur B. 
(ii) TITLE OF INVENTION: Identifying, Isolating and Cloning 
Messenger RNAs 
(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Choate, Hall & Stewart 

(B) STREET: Exchange Place, 53 State' Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02190 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 11 -MAR- 1993 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/850,343 

(B) FILING DATE: ll-MAR-1992 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Pastemack, Sam 

(B) REGISTRATION NUMBER: 29,576 

(C) REFERENCE/DOCKET NUMBER: DFCI234CIP 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617 227-5020 

(B) TELEFAX: 617 227-7566 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
' ITiTriTiTi ' TVN 

(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: other nucleic acid 
Ciii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2r 
TTTTTTTTTT TTV 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3; 
TTTTTTTTTT VNN 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
NNTTTATTNN 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
GCTTTATTNC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO # 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
NNNANNATGN 

10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
NNNANNATGG 

10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCCACCATGG 

10 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CTTGATTGCC TCCTACAGCA GTTGCAGGCA CCTTTAGCTG TACCATGAAG TTCACAGTCC 60 
GGGATTGTGA CCCTAATACT GGAGTTCCAG ATGAAGATGG ATATGATGAT GAATATGTGC 120 
TGGAAGATCT TGAGGTAACT GTGTCTGATC ATATTCAGAA GATACTAAAA CCTAACTTCG 180 
CTGCTGCCTG GGAAGAGGTG GGAGGAGCAG CTGCGACAGA GCGTCCTCTT CACAGAGGGG 240 
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TCCTGGGTGA AAAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TTTTTTTTTT TCA 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) -STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTTGATTGCC 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) IKDLECUIiE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
{iv> ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCCRCCATGG 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGCCAGCGAA 
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(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GACCGCTTGT 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AGGTGACCGT 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16; 
GGTACTCCAC 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTTGCGATCC 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCCGCCATGG CTCTGAAGA6 AATCCACAAG GACACCCATG AA 42 
(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
Cii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19r 
GTTGCATTTA CAACAAGAAT TTATCATCCA AATATTAACA GTAATGGCAG CATTTGTCTr SO 
GATATTCEAC GGTCACCT 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) 'TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GTAYGCNTGT 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTAYGCNTGC 
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Claims 

1. A non-specific cloning method for isolating in a nucleic acid sample a 
DNA complementaiy to a mRNA, comprising 

contacting the mRNA with a fiist oligodeoxynucleotide under conditions in 
which said first oHgodeoxynucleotide hybridizes with mKNA at a site including a . 
sequence immediately upstream of a first A ribonucleotide of the mRNA's polyA 
tail, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from the site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with DNA, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from the site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplilying the first and second DNA strands using a DNA polymerase, 
using said first and second oligodeoxynucleotides as primers. 

2. The method of claim 1 wherein said first oligodeoxynucleotide 
hybridizes with the mRNA at a site that includes at least one base upstream from 
and adjacent to the first A ribonucleotide of the polyA tail. 

3. The method of claim 2 wherein said first oligodeoxynucleotide 
hybridizes with the mRNA at a site that includes at least two bases upstream 
ftom and adjacent to the first A ribonucleotide of the polyA tail. 

4. The method of claim 1 wherein said first oligodeoxynucleotide 
includes a polyA-complementary region comprising at least 11 bases and, 
upstream from said polyA-complementary region, a non-poly-A complementary 
region comprising at least one base. 

5. The method of claim 4 wherein said non-polyA-complementary 
region comprises at least 2 contiguous bases 
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6. The method of claim 5 wherein said non-polyA-complementaiy 
region comprises 3'-NV, wherein V is one of dA, dC or dG, and N is one of 
dA,dT, dCordG. 

7. The method of claim 4 whereia said first oligodeoxynucleotide 
comprises at least 13 bases. 

8. Tlie method of claim 1 wherein said second oligodeoxynucleotide 
comprises at least 6 deoxyribonucleotides. 

9. The mediod of claim 1 wherein said second oUgodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

10. The mediod of claim 1 wherein said second oUgodeoxynucleotide 
includes a randomly selected nucleotide sequence. 

11. The method of claim 1 wherein said first or second 
oligodeoxynucleotide includes a selected arbitrary sequence. 

12. The method of claim 1 wherein said first or the second 
oligodeoxynucleotide includes dC, dG, dT and dA. 

13. The method of claim 1 wherein said first or second 
ofigodeoxynucleotide includes a restriction endonuclease recognition sequence. 

14. The method of claim I wherein said second oUgodeoxynucleotide 
includes a sequence identical to a sequence contaLaed within a mRNA of known 
sequence. 

15. The method of claim 1 wherein at least one of said first or second 
oUgodeoxynucIeotides comprises apluraUty of oUgodeoxynucleotides. 

16. A non-specific cloning method for isolating in a nucleic acid sample a 
DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oUgodeoxynucleotide under conditions in 
which said first oUgodeoxyuucleotide hybridizes widi mRNA at a site that 
includes the mRNA's polyA signal sequence, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oUgodeoxynucleotide as a primer, to produce a first.DNA strand 
complementary to at least a portion of the mRNA upstream from the site of 
hybridization of said first oUgodeoxynucleotide with the mRNA, 
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contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with DNA, 

extendmg the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from the site of hybridization of said second oHgodeoxynucleotide 
with said first DNA stand, and 

amplifying the first and second DNA strands using a DNA polymerase, 
using said first and second oligodeoxynucleotides as piimeis. 

17. The method of claim 16 wherein said first oligodeoxynucleotide 
comprises at least 6 nucleotides. 

18. The method of claim 16 wherein said first oligodeoxynucleotide 
comprises at least 9 nucleotides. 

19. The method of claim 16 wherein said second oligodeoxynucleotide 
comprises at least 6 deoxyribonucleotides. 

20. The method of claim 16 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

21. The method of claim 16 wherein said second oligodeoxynucleotide 
includes a randomly selected nucleotide sequence. 

22. The method of claim 16 wherein said first or second 
oUgodeoxynucleotide includes a selected aibitraiy sequence. 

23. The metiiod of claim 16 wherein said first or the second 
oligodeoxynucleotide includes dC, dG, dT and dA. 

24. The method of claim 16 wherein said first or second 
oligodeoxynucleotide includes a restriction endonuclease recognition sequence. 

25. The method of claim 16 wherein said second oligodeoxynucleotide 
includes a sequence identical to a sequence contained within a mRNA of known 
sequence. 

26. The method of claim 16 wherein at least one of said first or second 
oligodeoxynucleotides comprises a pluraUty of oligodeoxynucleotides. 

27. A method for isolating in a nucleic acid sample a DNA 
complementary to a mRNA, comprising 
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contacting the mENA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mENA at a site, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeojqrnucleotide hybridizes with the DNA 
strand at a site, said site including a Kozak sequence, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream fiom the site of hybridization of said second oIigodeo;q?nucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeojqmucleotides as primers. 

28. The method of claim 27 wherein said first oligodeoxynucleotide 
comprises at least 9 deoj^ribonucleotides. 

29. The method of claim 27 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

30. The method of claim 27 wherein said second oligodeoxynucleotide 
comprises at least 9 deo:^bonucleotides. 

31. The method of claim 27 wherein said second oligodeoxynucleotide 
comprises 10 deoxsrribonucleotides. 

32. The method of claim 27 wherein said first oligodeoxynucleotide is 
composed of a randomly selected sequence of deoxyribonucleotides. 

33. The method, of claim 27 wherein said first oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

34. The method of claim 27 wherem said first oligodeoxynucleotide 
iacludes a restriction endonuclease recognition sequence. 

35. The method of claim 27 wherein said first oligodeoxynucleotide 
includes a sequence substantially identical to a sequence contained within an 
mRNA of known sequence. 
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36. The method of claim 27 wherein said second oligodeoxynucleotide 
further includes a restriction endonuclease sequence. 

37. The method of claim 27 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

38. A non-specific cloning method for isolating in a nucleic acid sample 
a DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide, having a base 
sequence substantially complementary to a sequence in a mRNA of known 
sequence, under conditions in which said first oligodeoxynucleotide hybridizes 
with mRNA at a site having said substantially identical sequence, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with the DNA 
strand at a site, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from said site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

39. The method of claim 38 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

40. The method of claim 38 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

41. The method of claim 38 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

42. The method of claim 38 wherein said second oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 
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43. The method of claim 38 wherein said first oligodeoxymicleotide 
further inchides a restriction endonuclease sequence. 

44. The method of claim 38 wherein said second oligodeoxynucleotide 
is composed of a randomly selected sequence of deoxyribonucleotides. 

45. The method of claim 38 wherein said second oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

46. The method of claim 38 wherein the base sequence of said second 
oligodeoxynucleotide contains a restriction endonuclease recognition sequence. 

47. The method of claim 38 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

48. A non-specific cloning method for isolating in a nucleic acid sample 
a DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide, having 
a sequence substantially identical to a sequence in a mRNA of known sequence, 
under conditions in which said second oligodeoxynucleotide hybridizes with the 
first DNA strand at a site containing a complement of said substantially identical 
sequence, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream fix)m said site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

49. The method of claim 48 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 
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50. The method of claim 48 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

51. The method of claim 48 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

52. The method of claim 48 wherein said second oligodeoxynucleotide . 
comprises 10 deoxyribonucleotides. 

53. The method of claim 48 wherein said first oligodeoxynucleotide is 
composed of a randomly selected sequence of deoxyribonucleotides. 

54. The method of claim 48 wherein said first oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

55. The method of claim 48 wherein said first oligodeoxynucleotide 
contains a restriction endonuclease recognition sequence. 

56. The method of claim 48 wherein said second oligodeoxynucleotide 
further includes a restriction endonuclease sequence. 

57. The method of claim 48 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

58. A non-specific cloning method for isolating in a nucleic acid sample 
a DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide, having a sequence 
substantially complementary to a sequence in a mRNA of known sequence, under 
conditions in which said first oligodeoxynucleotide hybridizes with mRNA at a 
site containing said substantially identical sequence, 

reverse transcribing the mRNA using a reverse transcriptase, usiug said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream ftom said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with the DNA 
strand at a site, said site including a Kozak sequence, 

extending tiie second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
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downstream firom said site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

59. The method of claim 58 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

60. The metiiod of claim 58 wherein said first pUgodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

61 . The method of claim 58 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

62. The method of claim 58 wherein said second oligodeoxynucleotide 
comprises 10 dep^^bonucleotides. 



1/4 



5' 



■NBAAAAAAAAAAA- — An 



rTT-5' 



•NBAAAAAAAAAAA-— An 

-NvrnTTTrmr-s' 



\ 



NVIIIIIIIirTT-5' 



-NVTTTTTTTTTTT-5' 
5' -NNNNNNNNN 



5'-NNNNNNNNN 




— NVTTTTTTTTTTT-5' 




+ NVIIII 


mrrnr-s' 



5'-NNNNNNNNN- 



-NVTnTTTTTTTT-5' 
-NBAAAAAAAAAAA 



+ NVTnTTTTTTTT-5' 



5 '-NNNNNNNNN 
5 '-NNNNNNNNN 



-NVTTTTnTTTn-5' 
NVTIIIIIirTn-5' 

NBAAAAAAAAAAA 



5' -NNNNNNNNN 


— NVTTTTTTTTTTT-5 ' 


■5 '-NNNNNNNNN 


NVTTTrTnTnT-5' 
NBAAAAAAAAAAA 



8 



5'-NNNNNNNNN- 



-NVTrrrrnTTn-s' 

-NBAAAAAAAAAAA 



NNNNNNNNIi 
5'-NNNNNNNNN- 



-NyrmTTTTTTT-S' 
-NBAAAAAAAAAAA 



FIG. I 



2/4 



FIG. 2 

10 20 30 40 50 60 

CTTGATTGCC TCCTACAGCA GTTGCAGGCA CCTTTAGCTG TACCATGAAG TTCACAGTCC 

70 80 90 100 110 120 

GGGATTGTGA CCCTAATACT GGAGTTCCAG ATGAAGATGG ATATGATGAT GAATATGTGC 

130 140 150 160 170 180 

TGGAAGATCT TGAGGTAACT GTGTCTGATC ATATTCAGAA GATACTAAAA CCTAACTTCG 

190 200 210 220 230 240 

CTGCTGCCTG GGAAGAGGTG GGAGGAGCAG CTGCGACAGA GCGTCCTCTT CACAGAGGGG 

250 260 
TCCTGG GTGA AAAAAAAAAA 

Me 



FIG. 3 



3/4 



FIG. 4 




A /A 



5 ' -GCCACCATGGCTCTGAAGAGAATCCACAAGGACACCCATGAA 
Kozak 



GTTGCATTTACAACAAGAA 

TTTATCATCCAAATATTAACAGTAATGGCAGCATTTGTCTTGATATTCTACGGTCACCT-3' 



3'TGCCAGIGGA-5' 
AP-3 



FIG. 5 



SUBSTITUTE vSHEET 



PCTAJS93/02246 



I A. CLASSinCATION OF SUBJECT MATTER 
IPC(5) :C12P 19/34 
US CL :435/91 

I According to Intc rnaaoniil Patent Classification (IPC) or to both nat ional classification and IPC 
I B. nELDS SEARCHED 

Minimum documentation searched (classification system foUowed by classification symbols) 
U.S. : 435/6, 91; 935/21 



I Documentation searched other than minimum documentation to the extent that such documents 



are included in the fields searched 



Electronic data base consulted during the international search (name of data base and. where 
APS. CA, BIOSIS, MEDLINE 



practicable, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 

|y 

lY 



Citation of document, with indication, where appropr iate, of the relevant passages 

M. Innis et al. "PGR PROTOCOLS, A GUIDE TO MEraODS 
AND APPLIGATIONS", pubUshed 1990 by AGADEMIC PRESS 
ING.(GALIFORNIA), see pages 60-69. 

US, A, 4 683,195 (MULLIS ET AL) 28 JULY 1987, see column 6, 
lines 44-55, column 10, lines 47-57, claim 10. 

D. FREILFELDER, "MOLEGULAR BIOLOGY A 
COMPREHENSIVE INTRODUCTION TO PROKAR YQTES 'aND 
EUKARYOTES", pubUshed 1983 by JONES AND BARTLETT 
PUBLISHERS, Inc. (BOSTON), see pages 402-404. 



Further documents are listed in the continuation of Box 
Special categorici of cited documents: 

documctttdcfiamg die fenetil itite oftfae «ft whkb m aoccowkfettd 
to be put of pAiticuiirrekvaiice 

ctriicr documem pubUihod oa or iftcr djfi mtcmi^^ 

document whicfa may duow doubto on priority clun(i) or which is 
crted to erttbikh the publkatioo date SfaS^dSitic^ 
cpecalft«Ma(aaipcciAed) « ««r 

doamcot referring to u ord diKlosure, use. exhibHioa or other 



documeittpublkhiri prior to the mrrmatiooaifflh^ 

tfao pnonty date claimed •«w«»a 



^' ^} See patent femily annex. 



Relevant to claim No. 

1-62 

1-62 
16-26 



UUrdojwmtttpublbhedaf^ 

d«e aod not m cofifijct with the applkauioa but cited to undc^ 
pnnctple or dieoty uoderiying the toventioa 

docinnent of paiticular retevance; the claimed invention canooc be 
c<Mi«datd novel or canaoc be coniidend to invofve an inventive tteo 
wfacQ the document is tdcen alone 

document of paiticuhr relevance; die claimed invention cannot be 
coosideted to mvoh/A an inventive step wbca the document m 
eombmed wid» one or more other nich documents, such combinatkm 
bcmg obvKMatoapenoatiuUed tatheait 

documcatmcmber of the same patent fiunily 



Dale of the actual completion of the international search 


Dale of mailing of the international search report// 


29 April 1993 


17 MAY 1^^ 


1 Name and mailing addresa of the ISA/US 
1 Commisstoncr of Patciua and Trademariu 
1 Box PCX 

Washington, D.C. 20231 
1 Facsimile No. NOT APPLICABLE 


Authorized officer /// /I 


PAUL B. TRAN, PH.D. -^/f r J J/'/o " 
Telephone No. (703) 308-0196 - I 



